图形卷积网络(GCN)是最受欢迎的体系结构之一,用于解决分类问题,并附有图形信息。我们对图形卷积在多层网络中的影响进行了严格的理论理解。我们通过与随机块模型结合的非线性分离高斯混合模型的节点分类问题研究这些效果。首先,我们表明,单个图卷积扩展了多层网络可以至少$ 1/\ sqrt [4] {\ Mathbb {e} {\ rm veg对数据进行分类的均值之间的距离。 }} $,其中$ \ mathbb {e} {\ rm deg} $表示节点的预期度。其次,我们表明,随着图的密度稍强,两个图卷积将此因素提高到至少$ 1/\ sqrt [4] {n} $,其中$ n $是图中的节点的数量。最后,我们对网络层中不同组合的图形卷积的性能提供了理论和经验见解,得出的结论是,对于所有位置的所有组合,性能都是相互相似的。我们对合成数据和现实世界数据进行了广泛的实验,以说明我们的结果。
translated by 谷歌翻译
Graph-based learning is a rapidly growing sub-field of machine learning with applications in social networks, citation networks, and bioinformatics. One of the most popular models is graph attention networks. They were introduced to allow a node to aggregate information from features of neighbor nodes in a non-uniform way, in contrast to simple graph convolution which does not distinguish the neighbors of a node. In this paper, we study theoretically this expected behaviour of graph attention networks. We prove multiple results on the performance of graph attention mechanism for the problem of node classification for a contextual stochastic block model. Here the node features are obtained from a mixture of Gaussians and the edges from a stochastic block model. We show that in an "easy" regime, where the distance between the means of the Gaussians is large enough, graph attention is able to distinguish inter-class from intra-class edges, and thus it maintains the weights of important edges and significantly reduces the weights of unimportant edges. Consequently, we show that this implies perfect node classification. In the "hard" regime, we show that every attention mechanism fails to distinguish intra-class from inter-class edges. We evaluate our theoretical results on synthetic and real-world data.
translated by 谷歌翻译
Gradient-based first-order convex optimization algorithms find widespread applicability in a variety of domains, including machine learning tasks. Motivated by the recent advances in fixed-time stability theory of continuous-time dynamical systems, we introduce a generalized framework for designing accelerated optimization algorithms with strongest convergence guarantees that further extend to a subclass of non-convex functions. In particular, we introduce the \emph{GenFlow} algorithm and its momentum variant that provably converge to the optimal solution of objective functions satisfying the Polyak-{\L}ojasiewicz (PL) inequality, in a fixed-time. Moreover for functions that admit non-degenerate saddle-points, we show that for the proposed GenFlow algorithm, the time required to evade these saddle-points is bounded uniformly for all initial conditions. Finally, for strongly convex-strongly concave minimax problems whose optimal solution is a saddle point, a similar scheme is shown to arrive at the optimal solution again in a fixed-time. The superior convergence properties of our algorithm are validated experimentally on a variety of benchmark datasets.
translated by 谷歌翻译
Power grids, across the world, play an important societal and economical role by providing uninterrupted, reliable and transient-free power to several industries, businesses and household consumers. With the advent of renewable power resources and EVs resulting into uncertain generation and highly dynamic load demands, it has become ever so important to ensure robust operation of power networks through suitable management of transient stability issues and localize the events of blackouts. In the light of ever increasing stress on the modern grid infrastructure and the grid operators, this paper presents a reinforcement learning (RL) framework, PowRL, to mitigate the effects of unexpected network events, as well as reliably maintain electricity everywhere on the network at all times. The PowRL leverages a novel heuristic for overload management, along with the RL-guided decision making on optimal topology selection to ensure that the grid is operated safely and reliably (with no overloads). PowRL is benchmarked on a variety of competition datasets hosted by the L2RPN (Learning to Run a Power Network). Even with its reduced action space, PowRL tops the leaderboard in the L2RPN NeurIPS 2020 challenge (Robustness track) at an aggregate level, while also being the top performing agent in the L2RPN WCCI 2020 challenge. Moreover, detailed analysis depicts state-of-the-art performances by the PowRL agent in some of the test scenarios.
translated by 谷歌翻译
秋季和春季的寒冷温度有可能对葡萄藤和其他水果植物造成霜冻损害,这可能会大大降低收获产量。为了防止这些损失,农民在判断可能造成损失时,采取了昂贵的霜冻缓解措施,例如洒水装置,加热器和风车。然而,这种判断是具有挑战性的,因为植物的冷耐度在整个休眠期间变化,并且很难直接测量。这导致科学家开发了基于费力的现场测量数据,可以将其调整为不同的葡萄品种。在本文中,我们研究了深度学习模型是否可以基于在30年期间收集的数据来改善葡萄的冷坚硬预测。一个关键的挑战是,每个品种的数据量高度可变,有些品种只有少量。为此,我们研究了多任务学习来利用各种品种的数据,以提高个人品种的预测性能。我们评估了许多多任务学习方法,并表明,性能最高的方法能够显着改善单个品种的学习,并优于大多数品种的当前最新科学模型。
translated by 谷歌翻译
这项研究开发了一个固定时间收敛的鞍点动力学系统,用于在标准凸孔腔假设的放松下解决最小值问题。特别是,通过利用优化算法的动力学系统观点,可以获得加速到鞍点的收敛。而不是要求目标函数是强率 - 巧妙的concave(由于需要加速几个鞍点算法的加速收敛),而是保证仅满足双面Polyak的功能,可以保证均匀的固定时间收敛性 - {\ l} ojasiewicz(pl)不等式。已知大量的实际问题,包括可靠的最小二乘估计,可以满足双面PL不平等。与任何其他具有线性甚至超级线性收敛的最先进方法相比,所提出的方法可实现任意快速的收敛性,并且在数值案例研究中也得到了证实。
translated by 谷歌翻译
基于骨架的识别系统正在获得流行,并在骨骼中关注点或关节的机器学习模型已被证明在机器人技术等许多领域具有计算有效和应用。很容易跟踪点,从而保存空间和时间信息,这在抽象所需信息中起着重要作用,分类成为一项容易的任务。在本文中,我们旨在研究这些要点,但使用云机制,在该机制中我们将云定义为点的集合。但是,当我们添加时间信息时,可能不可能检索每个帧中一个点的坐标,而不是专注于单个点,我们可以使用k-neighbors来检索所讨论的观点的状态。我们的重点是使用重量共享收集此类信息,但请确保当我们尝试从邻居那里检索信息时,我们不会随身携带噪音。 LSTM具有长期建模功能,并且可以携带时间和空间信息。在本文中,我们试图总结基于图的手势识别方法。
translated by 谷歌翻译
AI目标通常很难正确指定。有些方法通过规范AI的副作用来解决此问题:代理必须用不完美的代理目标来权衡“他们造成了多少混乱”。我们通过援助游戏框架提出了一个正式的副作用正规化标准。在这些游戏中,代理解决了一个可观察到的马尔可夫决策过程(POMDP),代表了其对其应优化目标函数的不确定性。我们考虑在以后的时间步骤向代理揭示真正目标的设置。我们证明,通过将代理人奖励与代理商实现一系列未来任务的能力进行交易,可以解决此POMDP。我们通过在两个环境环境中的地面真相评估来证明问题形式化的合理性。
translated by 谷歌翻译
心理治疗干预技术是治疗师和患者之间的多方面对话。与一般的临床讨论不同,心理治疗的核心成分(即症状)很难区分,因此成为以后要总结的复杂问题。结构化的咨询对话可能包含有关症状,心理健康问题病史或发现患者行为的讨论。它还可能包含与临床摘要无关的讨论填充单词。我们将结构化心理治疗的这些要素称为咨询组成部分。在本文中,目的是心理健康咨询的摘要,以基于领域知识并帮助临床医生快速收集意义。在注释咨询组件的12.9k话语和每次对话的参考摘要之后,我们创建了一个新的数据集。此外,我们建议消费是一种新颖的咨询组件指导摘要模型。消费经历三个独立模块。首先,为了评估抑郁症状的存在,它使用患者健康问卷(PHQ-9)过滤了话语,而第二和第三模块旨在对咨询组件进行分类。最后,我们提出了针对特定问题的心理健康信息捕获(MHIC)评估指标,用于咨询摘要。我们的比较研究表明,我们改善了性能并产生凝聚力,语义和连贯的摘要。我们全面分析了生成的摘要,以研究心理治疗元素的捕获。摘要的人类和临床评估表明,消费会产生质量摘要。此外,心理健康专家验证了消费的临床可接受性。最后,我们讨论了现实世界中心理健康咨询摘要的独特性,并在Mathic.ai的支持下显示了其在线应用程序上的部署的证据
translated by 谷歌翻译
癌症是人体内部异常细胞的无法控制的细胞分裂,可以蔓延到其他身体器官。它是非传染性疾病(NCDS)和NCDS之一,占全世界总死亡人数的71%,而肺癌是女性乳腺癌后第二次诊断的癌症。肺癌的癌症生存率仅为19%。有各种方法用于诊断肺癌,如X射线,CT扫描,PET-CT扫描,支气管镜检查和活组织检查。然而,为了了解基于组织型H和E染色的肺癌亚型,广泛使用,其中染色在从活组织检查中吸入的组织上进行。研究报道,组织学类型与肺癌预后和治疗相关。因此,早期和准确地检测肺癌组织学是一种迫切需要,并且由于其治疗取决于疾病的组织学,分子曲线和阶段的类型,最重要的是分析肺癌的组织病理学图像。因此,为了加快肺癌诊断的重要过程,减少病理学家的负担,使用深层学习技术。这些技术表明了在分析癌症组织病变幻灯片的分析中提高了疗效。几项研究报告说,卷积神经网络(CNN)在脑,皮肤,乳腺癌,肺癌等各种癌症类型的组织病理学图片的分类中的重要性。在本研究中,通过使用Reset50,VGG-19,Inception_Resnet_V2和DenSenet进行特征提取和三重态丢失来引导CNN以引导CNN,以引导CNN,以引导CNN使得其增加群集间距离并减少集群内距离。
translated by 谷歌翻译